Cluster Analysis with Application to Geochemical Data
نویسندگان
چکیده
The goal of this presentation is to give a deeper insight into the large data sets of the C-horizon and the O-horizon soil samples with the help of cluster analysis and fuzzy cluster analysis. These mineral soil samples were taken from a big area in the course of a regional study in the European Arctic. An overview of the Kola data project can be found on the web site http://www.ngu.no/Kola. The aim of cluster analysis is to find groups in data, in which objects of the same group should be as similar to each other as possible, whereas objects of different groups should be as dissimilar as possible. First we want to explain the steps necessary for doing cluster analysis. In the literature many methods of clustering algorithm are described. There are some characteristics which should be the basis for the choice of the clustering algorithm. The most widely used clustering techniques will be explained in more detail. The procedure of evaluating the results of a clustering algorithm is known under the term cluster validity. We distinguish between external, internal and relative criteria. With validity measures one can find the optimal number of clusters. Moreover, they are helpful for making statements about the structure of the existing data and for the selection of an appropriate clustering algorithm. A good graphical presentation of cluster results is important for the interpretation. Whereas objects can easily be shown in maps using different colors or grey-scale, the presentation of the variables that are influential for each cluster is not trivial. We propose a new plot based on Reimann et al. (2000), which shows the degree of average concentrations of the elements in the clusters. Finally, we will present some cluster results. The clusters of the C-horizon reflect the various groups of rock types in this area. For the O-horizon we can localize environmental pollution from heavy industry and recognize the influence of weather and sea.
منابع مشابه
Application of C-A fractal model and exploratory data analysis (EDA) to delineate geochemical anomalies in the: Takab 1:25,000 geochemical sheet, NW Iran
Abstract Most conventional statistical methods aiming at defining geochemical concentration thresholds for separating anomalies from background have limited effectiveness in areas with complex geological settings and variable lithology. In this paper, median+2MAD as a method of exploratory data analysis (EDA) and concentration-area (C-A) fractal model as two effective approaches in separation g...
متن کاملGeochemical Evaluation of Drinking Water in Arak City, Iran
This paper presents results of an assessment of dominant hydro-geochemical processes controlling groundwater chemical composition, using an integrated application of cluster analysis and factor analysis. The area is located in south of saline playa and in Arak city. Cluster analysis classified samples into two main clusters according to their dominant chemical composition: cluster A (dominant c...
متن کاملExploration of Kahang porphyry copper deposit using advanced integration of geological, remote sensing, geochemical, and magnetics data
The purpose of mineral exploration is to find ore deposits. The main aim of this work is to use the fuzzy inference system to integrate the exploration layers including the geological, remote sensing, geochemical, and magnetic data. The studied area was the porphyry copper deposit of the Kahang area in the preliminary stage of exploration. Overlaying of rock units and tectonic layers were used ...
متن کاملUsing stream sediment data to determine geochemical anomalies by statistical analysis and fractal modeling in Tafrash Region, Central Iran
Iranian Cenozoic magmatic belt, known as Urumieh-Dokhtar, is recognized as an important polymetallic mineralization which hosts porphyry, epithermal, and polymetallic skarn deposits. In this regard, multivariate analyses are generally used to extract significant anomalous geochemical signature of the mineral deposits. In this study, stepwise factor analysis, cluster analysis, and concentration–...
متن کاملApplication of continuous restricted Boltzmann machine to detect multivariate anomalies from stream sediment geochemical data, Korit, East of Iran
Anomaly separation using stream sediment geochemical data has an essential role in regional exploration. Many different techniques have been proposed to distinguish anomalous from study area. In this research, a continuous restricted Boltzmann machine (CRBM), which is a generative stochastic artificial neural network, was used to recognize the mineral potential area in Korit 1:100000 sheet, loc...
متن کاملChemometrics-enhanced Classification of Source Rock Samples Using their Bulk Geochemical Data: Southern Persian Gulf Basin
Chemometric methods can enhance geochemical interpretations, especially when working with large datasets. With this aim, exploratory hierarchical cluster analysis (HCA) and principal component analysis (PCA) methods are used herein to study the bulk pyrolysis parameters of 534 samples from the Persian Gulf basin. These methods are powerful techniques for identifying the patterns of variations i...
متن کامل